Learning to compress videos without computing motion

نویسندگان

چکیده

With the development of higher resolution contents and displays, its significant volume poses challenges to goals acquiring, transmitting, compressing, displaying high-quality video content. In this paper, we propose a new deep learning compression architecture that does not require motion estimation, which is most expensive element modern hybrid codecs like H.264 HEVC. Our framework exploits regularities inherent motion, capture by using displaced frame differences as representations train neural network. addition, space-time reconstruction network based on both an LSTM model UNet model, call LSTM-UNet. The has three components: Displacement Calculation Unit (DCU), Compression Network (DCN), Frame Reconstruction (FRN). DCU removes need for estimation found in less expensive. DCN, RNN-based utilized compress well retain temporal information between frames. LSTM-UNet used FRN learn differential videos. experimental results show our MOtionless VIdeo Codec (MOVI-Codec), learns how efficiently videos without computing motion. experiments MOVI-Codec outperforms Low-Delay P veryfast setting coding standard exceeds performance global HEVC codec, same setting, measured MS-SSIM, especially latest H.266 (VVC) codec at bitrates, when assessed high-resolution

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Compress Ergodic Sources

We present an adaptive coding technique which is shown to achieve the optimal coding in the limit as the size of the text grows, while the data structures associated with the code only grow linearly with the text. The approach relies on Huffman codes which are generated relative to the context in which a particular character occurs. The Huffman codes themselves are inferred from the data that h...

متن کامل

Learning to Extract Motion from Videos in Convolutional Neural Networks

This paper shows how to extract dense optical flow from videos with a convolutional neural network (CNN). The proposed model constitutes a potential building block for deeper architectures to allow using motion without resorting to an external algorithm, e.g. for recognition in videos. We derive our network architecture from signal processing principles to provide desired invariances to image c...

متن کامل

Cloud Computing; A New Approach to Learning and Learning

Introduction: The cloud computing and services, as a technological solution for developing educational services, can accelerate the provision and expansion of these highly useful services. This study intended to provide an overall picture of practical areas of learning services based on cloud computing teaching and learning equipment. Methods: This was a theoretical hybrid research study in whi...

متن کامل

Jointly Learning to Extract and Compress

We learn a joint model of sentence extraction and compression for multi-document summarization. Our model scores candidate summaries according to a combined linear model whose features factor over (1) the n-gram types in the summary and (2) the compressions used. We train the model using a marginbased objective whose loss captures end summary quality. Because of the exponentially large set of c...

متن کامل

A Temporal-Compress and Shorter SIFT Research on Web Videos

The large-scale video data on the web contain a lot of semantics, which are an important part of semantic web. Video descriptors can usually represent somewhat the semantics. Thus, they play a very important role in web multimedia content analysis, such as Scale-invariant feature transform (SIFT) feature. In this paper, we proposed a new video descriptor, called a temporalcompress and shorter S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Signal Processing-image Communication

سال: 2022

ISSN: ['1879-2677', '0923-5965']

DOI: https://doi.org/10.1016/j.image.2022.116633